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Abstract 

Background: Ydjl and Sisl are structurally and functionally distinct Hsp40 proteins of the yeast cytosol. Sisl is an 
essential gene whereas the ydjl gene is essential for growth at elevated temperatures and cannot complement sisl 
gene deletion. Truncated polypeptides capable of complementing the sisl gene deletion comprise the J-domain of 
either Sisl or Ydjl connected to the G/F region of Sisl (but not Ydjl). Sisl mutants in which the G/F was deleted 
but G/M maintained were capable of complementing the sisl gene deletion. 

Results: To investigate the relevance of central domains on the structure and function of Ydjl and Sisl we 
prepared Sisl constructs deleting specific domains. The mutants had decreased affinity for heated luciferase but 
were equally capable of stimulating ATPase activity of Hsp70. Detailed low resolution structures were obtained and 
the overall flexibility of Hsp40 and its mutants were assessed using SAXS methods. Deletion of either the G/M or 
the G/M plus CTDI domains had little impact on the quaternary structure of Sisl analyzed by the SAXS technique. 
However, deletion of the ZFLR-CTDI changed the relative position of the J-domains in Ydjl in such a way that they 
ended up resembling that of Sisl. The results revealed that the G/F and G/M regions are not the only flexible 
domains. All model structures exhibit a common clamp-like conformation. 

Conclusions: Our results suggest that the central domains, previously appointed as important features for 
substrate binding, are also relevant keeping the J-domains in their specific relative positions. The clamp-like 
architecture observed seems also to be favorable to the interactions of Hsp40 with Hsp70. 



Background 

Molecular chaperones are proteins that are involved in 
assisting the folding and assembly of newly synthesized 
proteins recognizing non-native substrate proteins pre- 
dominantly via their exposed hydrophobic residues [1]. 
However, the conditions for the successful folding in 
vivo are not always favorable. The cellular environment 
is crowded and thus protein denaturation and aggrega- 
tion will be major problems. Thus, there is the need for 
chaperones that also protect cells from elevated tem- 
perature or other cellular stress situations, to achieve 
successful folding of proteins in vivo. There are several 
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families of Heat Shock Proteins (HSPs), each family acts 
to assist protein folding in a different way. 

An important chaperone family is the 40-kDa Heat 
shock protein (Hsp40). Chaperones from the Hsp40/ 
DnaJ family play important roles in cells by working 
together with molecular chaperone Hsp70 to promote 
protein folding, assembly, translocation and degradation 
[2-5]. Hsp40 proteins can interact with the hydrophobic 
side-chains of non-native polypeptides preventing aggre- 
gation [6,7]. Hsp40 can then form transient complexes 
with Hsp70 presenting non-native polypeptides for sub- 
sequent protein folding [8-10]. The members of the 
Hsp40 family typically contain a J-domain, which regu- 
lates the ATP-dependent binding of peptides by Hsp70 
[4,11,12]. 

Members of the Hsp40 family act as molecular cha- 
perones to bind and deliver non-native proteins to 
Hsp70 and can be divided into three main groups, from 
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which two: Type I and Type II are the most studied. 
The two types are not functionally equivalent [13-15] 
and exhibit major differences in chaperone activity [16]. 
In both types the J-domains are connected to the central 
and C-terminal domains via a G/F-rich (Glycine/pheny- 
lalanine-rich) linker (Figure 1). 

Type I Hsp40s, such as Escherichia coli DnaJ, yeast 
Ydjl and human Hdj2, contain a zinc-finger-Iike linker 
region (ZFLR) between the G/F domain and the C-ter- 
minus (Figure IB), and Type II Hsp40 proteins such as 
yeast Sisl and human Hdjl contain a G/M-rich (Gly- 
cine/Methionine-rich) domain in the linker region (Fig- 
ure lA) [14-17]. Both types have a substrate-binding site 
located at their C-terminal domain, which is divided in 
subdomains I (CTDI) and II (CTDII) (Figure 1). 

The reason why Type I and Type II Hsp40s exhibit 
differences in chaperone activity is unknown. Some bio- 
chemical and structural studies have already provided 
some insight into this question, suggesting that the 
answer resides in their structural differences [15,18,19]. 
Hence, the need for a comprehensive study of their 
structure is of considerable importance. 



High-resolution structural studies with fragments of 
Sisl and other Type II Hsp40s indicated that these pro- 
teins function as homodimers that have a clamp-like 
architecture and use a shallow groove located on the 
surface of monomers to bind non-native proteins 
[20,21]. In addition, the G/F-rich regions of Ydjl and 
Sisl lie adjacent to their putative polypeptide binding 
domains and they seem to specify the functions of these 
Hsp40s [22-24]. These structure/function studies 
together with hydrodynamic analysis showed that both 
types of Hsp40 proteins form dimers in solution 
[6,18,19]. Dimer formation plays a critical role in 
Hsp40s chaperone activity because disruption of the 
dimerization motifs results in severe defects in both cha- 
perone functions [20,25]. 

Previous results showed that human and yeast Type I 
and Type II Hsp40s, have distinct quaternary structure 
[18,19]. These results raised the hypothesis that the cen- 
tral domains controlled the quaternary structure of both 
types of Hsp40s, because in chimeric mutants, in which 
the central domains of Ydjl (ZFLR) and Sisl (G/M) 
were switched, their properties were exchanged. A 
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Figure 1 Domain organization of the Hsp40s and mutants. (A) Schematic representation of Sisl domains from known liigh-resolution 
structures. (B) Schematic representation of Ydjl domains from high-resolution structures. 
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chimeric Ydjl in which the ZFLR had been switched by 
the G/M from Sisl proved to be functionally and struc- 
turally similar to Type II Sisl. Correspondingly, a chi- 
meric Sisl in which the G/M had been switched by the 
ZFLR from Ydjl proved to be functionally and structu- 
rally similar to Type I Ydjl [15,19]. 

To increase our knowledge on the role of the central 
domains in the structure/function relationship we 
defined the biophysical and functional features of Sisl 
and Ydjl mutants with deleted central domains that 
were specific for Type I or Type II Hsp40s. These stu- 
dies entailed a functional analysis of mutated Hsp40s 
coupled with biophysical investigation of the quaternary 
structure by dynamic light scattering, analytical ultra- 
centrifugation and small-angle X-ray scattering (SAXS). 
A refined solution structure of Sisl was also obtained 
using improved SAXS data. The outcome of these stu- 
dies showed that deletion of either the G/M or the G/M 
plus CTDI domain regions had minor impact on the 
overall quaternary structure of Sisl. Consequently, our 
results suggest that the central domains are important 
for substrate binding and maintenance of the J-domains 
in their specific relative positions. 

Results and Discussion 

Protein samples 

Sisl is a yeast member of the Type II Hsp40s family and 
contains 352 residues arranged in a highly conserved a- 
helical N-terminal J-domain, a disordered middle region 
(divided into glycine/phenylalanine (G/F) and glycine/ 
methionine (G/M) rich regions) and two C-terminal 
sub-domains (CTDI and CTDII) as shown in Figure lA. 
In order to understand the role of the central regions in 
the structure and function of Type II Hsp40s, two Sisl 
deletion mutants were produced: Sisl_Ai24-i74> from 



which the G/M region had been deleted, and Sisl_Ai2i- 
257, from which both the G/M and the CTDI had been 
deleted (Figure lA). Ydjl is a yeast Type I Hsp40 that 
contains 409 residues also arranged in a highly con- 
served a-helical N-terminal J-domain, a disordered mid- 
dle region (a glycine/phenylalanine (G/F)), a Zinc finger 
domain (ZFLR or Cys-rich domain) and a two C-term- 
inal sub-domains (CTDI and CTDII) as shown in Figure 
IB. Again, to understand the role of those regions in 
that protein structure, we also studied a Ydjl deletion 
mutant that we named Ydjl_Aio6-255> which both the 
Zinc finger-like region (ZFLR) and the CTDI had been 
deleted (Figure IB). The mutants were purified with no 
apparent contamination (>95% pure; Figure 2A) and 
maintained at 4°C to avoid degradation. The folded con- 
formation of the proteins was investigated by circular 
dichroism (CD) spectroscopy (Figure 2B). As previously 
shown for the wild- type proteins Sisl and Ydjl [19], the 
mutants had CD spectra of well folded proteins with 
minima at about 208 and 220 nm and a positive peak 
bellow 200 nm (Figure 2B), The shapes of the spectra 
indicated that no large unfolded portion was present. 
This is in good agreement with the results from hydro- 
dynamic measurements (see below). 

Hsp40 function 

Hsp40s act by binding an unfolded or partially unfolded 
protein (client protein) delivering it to Hsp70 and con- 
comitantly stimulating the Hsp70 ATPase activity. ATP 
hydrolysis by Hsp70 is a crucial step in protein folding 
assisted by this chaperone. The effect of the deletions 
on the function of the Hsp40s was assayed by testing 
both the ability to bind a client protein and the stimula- 
tory effect on the ATPase activity of Hsp70 (Figure 3). 
First, the ability to bind heated luciferase (a client 
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Figure 2 Results from protein expression: (A) SDS-PAGE 12% showing that the recombinant mutant proteins were > 95% pure. Molecular 
masses for standard proteins are shown on left. (B) CD spectra of recombinant proteins (10-40 |jM) were measured from 195 to 260 nm in 25 
mM Tris-HCI (pH 7.5) containing 500 mM NaCI and are shown as molar residual ellipticity ([©]). 
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protein) was tested and the Sisl binding was set as stan- 
dard (100%). Compared to Sisl, the efficacy of 
Sisl 

_Ai24-i74> of about 60%, and both Sisl_Ai2i-257 
and Ydjl_Aio6-255 were of about 40% (Figure 3A). Sec- 
ond, the mutants were assayed regarding their abiUty to 
stimulate ATP hydrolysis of Ssal (Hsp70) and the Sisl 
stimulatory effect was set as standard (100%). Sisl_Ai24- 
174 and Ydjl_Aio6-255 had effect similar to that of Sisl, 
inside the error, and the effect of Sisl_Ai2i-257 was of 
about 90% (Figure 3B). For comparison, the perfor- 
mance of Ydj 1 in both experiments was similar to that 
of Sisl, within experimental error (data not shown). The 
results show that while the deletions decreased in about 
50% the ability to bind client proteins, they seem to 
have no effect on the ability of the J- domain in interact- 
ing with Hsp70. These results suggested that the J- 
domains maintained their proper conformation and 
functionality. 

Hydrodynamics 

Sisl and Ydjl are dimers in solution and here we used 
analytical ultracentrifugation (AUG) to investigate the 
oligomeric status of the deleted mutants. We performed 
AUG sedimentation velocity (SV) experiments and fitted 
the data using SedFit that supplied continuous sedimen- 
tation distribution c(S) (Figure 4A). From the maximum 
of the peaks of the c(S) curve, the apparent s, which was 
corrected to standard conditions (s2o,w) ^^id plotted 
against protein concentration (Figure 4B). The extrapo- 
lation of S2o,w to 0 mg/mL gave s^2o,w which is an intrin- 
sic property of the protein and contains information 
about both the molecular mass (M) and the asymmetry 
of the molecule. Normally, a variation in the value of 



s^2o,w induced by external factors (pH changes, salt 
strength, ligands or temperature) is related to conforma- 
tional changes [26]. The values of s°2o,w and D^2o,w> 
obtained from dynamic light scattering (DLS) experi- 
ments, are shown on Table 1. The M values obtained 
from both the c(M) distribution and the s/D ratio 
(Equation 4), and the weight average factor ///o are also 
shown (Table 1). Our results suggested that Sisl_Ai24- 
174, Sisl_Ai2i-257 and Ydjl_Aio6-255 are also dimers in 
solution and they have an asymmetric or elongated 
shape as shown previously for Hsp40 proteins [18,19]. 

SAXS results 

To explore the impact of selective central domains dele- 
tion on Hsp40s quaternary structure, SAXS analysis was 
performed. The corrected and normalized experimental 
SAXS curves for the proteins Sisl, Sisl_Ai24-i74> 
Sisl_Ai2i_257> and Ydjl_Aio6-255 in the q range 0.01 < q 
< 0.25A'^ are displayed in Figure 5(A), together with the 
respective regularization fitting (solid lines) resulting 
from the p(r) function calculation using GNOM. The 
corresponding p(r) functions, shown in Figure 5(B), indi- 
cated a slightly elongated shape for all the proteins in 
solution, confirming the AUG results. The Kratky plots 
presented in Figure 5(G) showed that those proteins are 
all quite flexible in solution, which gave us a clue about 
the difficulty in crystallizing them. An inspection of the 
Porod plots in Figure 5(D) shows that the curves do not 
contain a significant plateau region indicating well 
defined particle volumes, but the plots for Sisl and 
Sisl_Ai24-i74 present a fairly flat region in the q^-range 
0.0012<q^<0.0025A'^, suggesting a more compact con- 
formation for those two proteins. The complete 
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Figure 4 Results from analytical ultracentrifugation (AUG) 
sedimentation velocity (SV) experiments. (A) The Figure shows 
experiments using 1,000 |jg/mL of protein concentration in 25 
mmol/L Tris-HCI (pH 7.5) containing 500 mmol/L NaCI. (B) The 
software Sednterp was used to correct the apparent s to the S2o,w 
Proteins s°2o,vv values were calculated from the plot of S2o,w versus 
protein concentration and are shown in Table 1. 



deviation from the Porod regime leads to predict more 
flexible structures for the Sisl_Ai2i-257> and Ydjl_Aio6- 
255 proteins. The degree of flexibility of each individual 
protein is thus confirmed from these plots. The inset in 
Figure 5(A) displays the ln(I(q)) vs. plot within the 
validity region for the Guinier approximation (qRg<l) 
together with the corresponding linear regression for 
each protein. The linearity of those plots confirmed the 
monodispersity of the samples. The maximum dimen- 
sion (Dmax) values, the radii of gyration (Rg) obtained by 
Guinier approximation and from the p(r) functions, as 
well as the calculated and estimated M plus the oligo- 
merization states of all the proteins are presented in 
Table 2. The Rg values obtained by the two approaches 
are in close agreement. The most reliable Rg value for 
each molecule is that obtained from the p(r) function, 
derived from the complete experimental curve. As 
described in Material and Methods, a bovine serum 
albumin (BSA) solution was used as reference sample 
for the M estimation using the SAXS data. The results 
indicated that all the proteins exist in a dimeric state in 
solution, since the values obtained are approximately 
twice the values calculated from the primary sequence 
(Table 2). 

Ab initio and rigid body model calculations based on 
SAXS data 

Two ab-initio computational routines were used to cal- 
culate the low resolution models for the molecular 
envelopes of the proteins Sisl, Sisl_Ai24_i74, Sisl_Ai2i_ 
257, and Ydjl_Aio6-255 proteins. The Dummy Atoms 
(DA) and Dummy Residues (DR) models were derived 
from the X-ray scattering data introducing a 2-point 
symmetry constraint in the calculations. This assump- 
tion was based on the solution scattering data, which 
indicated that all the molecules under study were dimers 
in solution. The crystallographic structure that identified 



Table 1 Summary of the hydrodynamic data obtained from DLS and AUC experiments. 



Hydrodynamic properties Protein 





Sisl 


Sis1_Ai24-174 


Sis1_Ai2i-257 


Ydj1_Aio6-255 


Predicted M (kDa)* 


75.2 


66.1 


46.8 


56.1 


Experimental M (kDa) 


64 ± 3^ 


58 ± 2* 58 ± 1^ 


45 ± 2* 43 ± 1^ 


54 ± 1* 55 ± 2^ 


5°20,w (S)* 


3.5 ± 0.1 


3.4 ± 0.1 


3.0 ± 0.1 


3.6 ± 0.1 


Ssph (S) 


6.5 


5.9 


4.7 


5.2 


D2o,w (10"^cmVseg) 


5.1 ± 0.2 


5.4 ± 0.1 


6.3 ± 0.1 


5.9 ± 0.2 


Dsph (10""cmVseg) 


7.7 


8.0 


9.0 


8.45 


f/fo 


1.6 ± 0.1 


1.6 ± 0.1 


1.5 ± 0.1 


1.4 ± 0.1 



* M predicted from amino acids sequence for all proteins as dimers. 

* data from SV experiments fitted by SedFit software (see Material and methods for details); 
^ M calculated from s/D ratio as showed by Equation 4; 

data from [19]. 
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Figure 5 SAXS results for the proteins Sisi, Sis1_AT24-i74/ Sis1_AT2i-257/ and Ydj1_Aio6-255 (A) Scattering intensity curves for all the 
proteins. The inset shows the Guinier region showing the linear regression fit of the experimental data for all proteins. (B) Corresponding p(r) 
functions. (C) Kratky plots showing different degrees of flexibility for each protein. (D) Porod plots for the four proteins. A small plateau region in 
the curves of Sisl and Sisl_Ai24-i74 indicate a more defined molecular volume for those macromolecules. 



the Sisl peptide binding fragment as a homodimer in 
the crystal [20] as well as information on the modeled 
dimer structure of the Ydjl peptide binding fragment 
[25] were taken into account in our calculations. The q- 
range used in the DAMMIN model calculation was the 



one generally chosen for the application of this program 
(qmax- 8/Rg). The full q-range range (0.01<q<0. 25 A'l) 
was used for the DR and rigid body modeling routines 
(see Methods). Keeping in mind that SAXS ab initio 
modeling routines do not produce a unique solution, 



Table 2 Overall parameters derived from the SAXS results for the proteins Sis1, Sis1_Ai24-i74/ Sis1_Ai2i-257 and 

Ydj1_Al06-255 



Protein 



Structural properties 


Sisl 


Sis1_Ai24-174 


Sis1_Ai2i.257 


Ydj1_Aio6-255 


C (mg/mL) 


7.1, 3.1 


6.8, 4.2 


5.2, 3.9 


5.5, 2.75 


Rg (A) (Guinier) 


42 ± 1 


42 ± 1 


35 ± 1 


33 ± 1 


Rg (A) (from p(r)) 


43.3 ± 0.5 


42.6 ± 0.3 


35.2 ± 0.3 


34.2 ± 0.3 


Dm ax (A) 


160 


140 


125 


110 


M (kDa) (SAXS) 


-71 


-69 


-42 


-47 


Predicted M (kDa)* 


75.2 


66.1 


46.8 


56.1 


Oligomerization state 


Dimer 


Dimer 


Dimer 


Dimer 



* M predicted from amino acids sequence for all proteins as dimers. 
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ten independent runs were performed for each 
calculation. 

The resolution of the DA and DR models does not 
permit an unambiguous determination of the spatial 
positions of secondary structure elements, but they 
portray the overall recurring extended shape of the 
most frequent conformations adopted by these mole- 
cules in solution. A spherical start volume was used 
for the DA modeling calculation in order to minimize 
the generation of any specific direction for the p2 
symmetry introduced based on the fact that all mole- 
cules were dimers. Also, since we were dealing with 
flexible molecules, it was important to check if the DR 
and Rigid Body approaches (non initial-volume depen- 
dent) were giving similar results. Consequently, rigid 
body (RB) calculations were performed in an attempt 
to obtain information on the position of the J-domain 
and other domains with available crystallographic data 
to restore the protein structures based on SAXS data. 
In all cases, we tried to model the molecular envelope 
of the protein, performing calculations with 2-point 
symmetry constraints because of the proteins dimeric 
structure as explained above. Also, as previously men- 
tioned, the dimerization sites of all proteins were 
known a priori from their X-ray crystallographic 
structure and this information was used to impose 
dimerization contact conditions in the RB calculations. 
The position of the linker between the C-terminal 
domains and the J-domain of each protein was known 
from the amino acid sequences. Ten runs were per- 
formed for each set of calculations. The multiple runs 
gave almost coincident results and allowed the 



identification of the position of the individual domains 
in the RB models. 

The ab initio and rigid body models calculated for 
the full length Sisl protein and its deleted-domain 
mutants are presented in Figure 6. The DA (filter aver- 
aged), DR and RB models for proteins Sisl, Sisl_Ai24- 
174, Sisl_Ai2i-257> are shown in panels A, B and C 
respectively. In the case of the DR models, averaging 
does not add substantially to the results. So, the mod- 
els displayed are those presenting the lowest normal- 
ized spatial discrepancy (NSD) values, which also 
showed the best agreement with the DA model. NSDs 
tend to zero for nearly similar objects and when they 
exceed 1, the objects systematically differ from one 
another (as explained in the Methods section). The 
NSDs calculated can be considered reasonably good 
for the DA and DR modeling approaches on account 
of the flexibility and conformational changes of these 
molecules. The itemized NSD values for each DA, DR 
and RB models of Sisl, Sisl_Ai24-i74> Sisl_Ai2i-257> are 
listed in Table 3. 

The filtered average of the ab initio DA model, the 
DR model with the lowest NSD value and the rigid 
body model of the Ydjl_Aio6-255 protein are presented 
in Figure 7. These results were obtained following the 
same procedures previously described for the Sisl full 
length protein and its deleted domain mutants. The 
NSD values corresponding to these model calculations 
(Table 3) also were within the values being considered 
reasonably good for each different approach. The DR 
model with the lowest NSD value presents a remarkably 
good resemblance with the DA and rigid body models. 
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Figure 6 Results of the ab initio (DA and DR) model calculations as well as the rigid body models are displayed for the full length 
Sisl protein and its deleted mutants. (A) Sisl, (B) Sisl _Ai 24-1 74, (C) Sisl _Ai 21-257- 
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Table 3 Normalized Spatial Discrepancy (NSD) range obtained by pairwise comparison of the SAXS models in each 
group of calculations 



Modelling method 






Protein 




Sisi 


Sis1_Ai24-174 


Sis1_Ai2i-257 


Ydj1_Aio6-255 


Dummy atoms 


1 .00 - 1 .22 


1.17 - 1.41 


0.75 - 1.01 


0.88 - 1.21 


Dummy residues 


1.41 - 1.97 


1.84 - 2.68 


1 .47 - 1 .76 


1.38 - 1.56 


Ab initio and rigid body 


1 .95 - 2.87 


1.95 - 2.51 


1 .54 - 1 .85 


1.87 - 2.11 



and can be considered an excellent description of the 
Ydjl_Aio6-255 protein conformation in solution. 

Conclusions 

In previous studies, Ydjl and Sisl were found to be 
homodimers, whose structures differed in the context of 
the orientation of the J-domain in relation to the long- 
axis of the respective proteins [18,19]. These differences 
in structure correlate with functional differences exhib- 
ited by Ydjl and Sisl, and the G/F and G/M regions in 
these proteins were proposed to impact J-domain orien- 
tation [15]. To test this hypothesis we carried out 
hydrodynamic and low-resolution structural studies on 
Sisl and Ydjl deletion mutants. The deletion of the G/F 
region from Sisl and Ydjl had little impact on the func- 
tion of the J-domain in regulating Hsp70 ATPase 



Model ^ 




Figure 7 Results of the ab initio (DA and DR) model 
calculations and the rigid body model obtained for the 
Ydj1_Aio6-255 protein, using high resolution data available for 

some of its domains. 

\ J 



activity, but did decrease polypeptide binding activity. 
Surprisingly, deletion of the G/F region did not change 
the quaternary structure or overall flexibility of Hsp40s. 
However, deleting the ZFLR plus the CTDI domain in 
Type I Ydjl altered the conformation of J-domains from 
lying along the long axis of the molecule to be orien- 
tated in a crosswise direction as in Type II Sisl. Thus, 
orientation of J-domains in Type I Hsp40 may be con- 
trolled by the ZFLR-CTDI, which is consistent with 
results from domain swap experiments [15,19]. When 
these domains are missing, as in the deleted mutants 
studied here or in the Type II Hsp40s, the J-domains 
become oriented in a crosswise direction. Our results 
also point to interactions between the ZFLR-CTDI and 
the J-domains as an important factor in determining the 
unique quaternary structure of Type I Hsp40s. 

One interesting feature of Type I and Type II Hsp40s 
that is clear from our data is that these sub-types of 
Hsp40s are highly flexible. The SAXS intensity data 
obtained within a wider range of reciprocal space in the 
present experiments allowed a comparative analysis of 
the flexible nature of the proteins by means of the 
Kratky and Porod plots. These results showed that the 
Type I Ydjl and Type II Sisl Hsp40 proteins are highly 
flexible in solution and confirm the dimerization at the 
C-terminal. Besides, flexibility was observed in all con- 
structs studied, even after deleting a flexible region (G/ 
M). So, other flexible regions should exist in the protein 
in addition to G/M. Prior to the studies presented 
herein it was assumed that the G/F and G/M domains 
were the only flexible regions of Hsp40s that enabled 
the J-domain multiple paths to gain access to the Hsp70 
ATPase domain. However, Kratky and Porod analysis of 
SAXS data on Sisl and Ydjl deletion mutants show that 
both remain highly flexible in the absence of the G/F 
region. Hsp40s bind and deliver proteins that range 
from nascent monomeric polypeptides to amyloid-like 
aggregates to Hsp70. Thus, the overall flexibility of Type 
I and Type II Hsp40s detected may be important to per- 
mit delivery of proteins or different sizes or assembly 
states to the Hsp70 polypeptide binding site and allow 
for simultaneous interaction of the J-domains with 
Hsp70s ATPase domain [5]. These results are also in 
agreement with crystallographic structural studies 
reported by Hu et al. [27], in which they observed that 
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the CTDI of human Hsp40 may possess significant flex- 
ibiUty. In their work, the authors proposed an "anchor- 
ing and docking" model for Hsp40 in which the 
flexibiUty of the CTDI may be important for Hsp40 to 
regulate the size of the cleft in its interaction with non- 
native polypeptides and transfer them to Hsp70. 

Functions for the G/F and G/M regions found in Type 
II Hsp40 in regulation of Hsp70 ATPase activity and/or 
substrate binding are not clear. Yet, this is an important 
question because a Sisl fragment containing just the J- 
domain and G/F region is capable of rescuing the lethal- 
ity of the sisl A strain [22]. In addition, amino acid resi- 
dues in the G/M region appear to play a role in 
specification of Sisl function [28]. Studies with forms of 
Sisl in which the G/M domain was deleted show that 
loss of the G/M impairs the ability of Sisl to bind dena- 
tured luciferase, but have no effect on regulation of 
Hsp70 ATPase activity. The G/M region lies adjacent to 
the hydrophobic grove in CTDI implicated as a polypep- 
tide binding site and methionine has a hydrophobic side 
chain, which may help to build the proper binding site. 
Thus, the G/M region helps specify Type II Hsp40 func- 
tion by assisting in substrate binding. 

Since the effects on function are likely to be related to 
changes in the structure of the Hsp40s, one important 
objective of this work was to obtain low resolution mod- 
els of the mutants under study using the latest computa- 
tional methods available for the spatial representation of 
these molecules in solution using small angle scattering 
data. To date, there are crystallographic structures of 
isolated domains and the quaternary structure of 
Hsp40s is mainly known as a result of SAXS data from 
these proteins in solution, combined with other hydro- 
dynamic techniques. Previous structural studies have 
also proved that Type I Ydjl and Type II Sisl have dis- 
tinct functions and quaternary structures [15,19]. In this 
work, estimation of molecular masses from SAXS and 
AUC data indicated that the deletion mutants in solu- 
tion dimerized at the C-terminal, just like the full-length 
protein. Thus, it is clear that the C-terminal region is 
very important for dimerization of both Type I and 
Type II Hsp40s. 

Using three different low-resolution modeling meth- 
ods, we obtained molecular envelopes for each protein. 
The rigid body modeling method seemed to be the most 
appropriate for flexible proteins like Hsp40 and also elu- 
cidated the domain arrangement which is important to 
understand possible functions of the deleted domain 
mutants in the cell. Interestingly, even deleting some 
flexible domains of Sisl and Ydjl, the constructs kept 
their flexibility and maintained the clamp-like architec- 
ture of the full-length protein with the J-domains point- 
ing outwards in opposite directions. This architecture 



seems to be favorable to the interactions of Hsp40 with 
Hsp70. The models we built for the proteins seem to 
agree with the anchoring and docking model, proposed 
by Qian et al. [10], describing how Hsp40 facilitates the 
delivery of non-native polypeptides to Hsp70. 

Methods 

Protein expression and purification 

The recombinant protein Sisl and Ydjl were expressed 
and purified by two chromatographic steps as previously 
described [15,19,21]. Two Sisl mutants were prepared 
from DNA constructions and expressed in Escherichia 
coli BL21(DE3)pLys: Sisl_Ai24-i74 (pETllaSISl_Ai24- 
174), deleting from residues 124 to 174, and Sisl_Ai2i_ 
257 (pETllaSISl_Ai2i-257)> deleting from residues 121 to 
257. Additionally, one DNA construction for Ydjl 
mutant (pETllaYdjl_Aio6-255) deleted from residues 
106 to 255, was prepared and was also expressed in E, 
coli BL21(DE3) strain. Cells were grown at 37°C up to 
an optical density at 600 nm of 0.7. The temperature 
was reduced to 30°C and the protein expression was 
induced with 0.4 mM isopropyl thio-P-D-galactoside 
(IPTG) during 4 hours. Thereupon, the cells were har- 
vested by centrifugation during 10 min at 2,600 x g. 
The pellet was ressuspended in 50 mM Tris-HCl (pH 
8.0), 500 mM KCl and 1 mM EDTA (15 mL/L of LB 
medium). The cells were lysed by adding 30 (ig/mL of 
lysozyme (Sigma) and 5 U of DNAse (GIBCO BRL), 
kept for 30 min at ice bath, and then disrupted by soni- 
cation and centrifuged (30 min at 26,000 x g). 

The purification of the proteins was performed as pre- 
viously described [16,29]. Summarily, Sisl, Sisl_Ai24-i74 
and Sisl_Ai2i-257 were submitted to a cationic chroma- 
tography in a Macro-prep^™^ High S Support resin 
(BioRad) using an AKTA FPLC device (Pharmacia Bio- 
tech). The resin was equilibrated with 20 mM Tris-HCl 
buffer (pH 7.5) and 20 mM NaCl. Ydjl and Ydjl_Aio6- 
255 were submitted to an anionic chromatography in a 
Macro-prep^^^ High Q Support resin (BioRad) using an 
AKTA FPLC device (Pharmacia Biotech). The proteins 
were eluted by NaCl gradient, dialyzed overnight against 
buffer 20 mM Phosphate (pH 7.5), and further purified 
by chromatography in a CHT™ Ceramic Hydroxyapa- 
tite Type II resin (BioRad) at an AKTA FPLC (Pharma- 
cia Biotech). The target proteins were eluted by a 
phosphate gradient. Ydjl_Aio6-255 was further purified 
by a size exclusion chromatography in a Superdex 
200pg using an AKTA FPLC (Pharmacia Biotech) pre- 
viously equilibrated with 25 mM Tris-HCl buffer (pH 
7.5) and 500 mM NaCl. The efficacy of the purification 
was checked by 12% SDS-PAGE. Unless stated other- 
wise, all proteins were diluted in buffer 25 mM Tris- 
HCl (pH 7.5) containing 500 mM NaCl. 
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Circular dichroism 

Circular dichroism (CD) measurements were performed 
using a Jasco J-810 spectropolarimeter coupled to a Pel- 
tier-type System PFD 425S for temperature control and 
optimized for best performance as previously described 
[30]. The proteins were re-suspended in buffer 25 mM 
Tris-HCl (pH 7.5) containing 500 mM NaCl. Proteins con- 
centration ranged from 10 to 40 [iM and the spectra were 
collected at a scan rate of 50 nm/min with a spectral band- 
width of 1 nm and using a 0.2 mm path length cell. 

Chaperone activity 

Hsp40s activities were tested by their ability to bind 
heated denaturated luciferase and to stimulate Hsp70 
Ssal ATPase activity as previously described [15,21]. 
The ability to bind heated luciferase (a client protein) 
was tested and the Sisl binding was set as standard 
(100%). With respect to Hsp40s ability to stimulate the 
ATPase activity of Hsp70, the mutants were assayed 
regarding their ability to stimulate ATP hydrolysis of 
Ssal (Hsp70) and the Sisl stimulatory effect was set as 
standard (100%). 

Dynamic Light Scattering 

The experimental diffusion coefficient (D) was obtained 
by dynamic light scattering (DLS) using a DynaPro-MS/ 
X device (Protein Solutions). The experiments were per- 
formed at 20°C, and proteins concentration ranged from 
0.5 to 2.0 mg/mL. The D value was corrected to stan- 
dard conditions (D2o,w) and extrapolated to 0 mg/mL 
concentration (D^2o,w) in order to avoid effects of viscos- 
ity and temperature. D is related to the frictional coeffi- 
cient if) by the following equation: 



D : 



RT 

Na7 



(1) 



where T is the absolute temperature, R is the gas con- 
stant and Na is the Avogadro's number. 

For a protein with known Stokes radius (Rs) and visc- 
osity (r|), / can be obtained applying the Stokes equa- 
tion: 



/ = 67tr]Rs 



(2) 



For comparison, the frictional coefficient for a spheri- 
cal particle (/o) can be calculated using the predicted 
Stokes radius (Rq) for a smooth and compact spherical 
protein of molecular mass M: 



1/3 



(3) 



where Vbar is the partial specific volume, /o is used to 
obtain the maximum diffusion coefficient (D^ph) 



applying equation 1 and the frictional ratio (///o) is 
used to indicate particle asymmetry when compared to 
a globular protein of same M giving information on the 
shape of the proteins [26,31]. 

Analytical Ultracentrifugation 

Analytical ultracentrifugation (AUC) experiments were 
performed in a Beckman Optima XL-A analytical ultra- 
centrifuge. Sedimentation velocity (SV) experiments 
were carried out in concentrations ranging from 150 to 
1,000 (ig/mL. The SV experiments were performed at 
20°C, using 30,000 rpm (AN-60Ti rotor) for Sisl_Ai2i- 
257 and Ydjl_Aio6-255) and 25,000 rpm for Sisl_Ai24_i74. 
The SedFit software (Version 9.4) was used to deconvo- 
lute the sedimentation and diffusion data in order to 
obtain the continuous sedimentation distribution c(S) 
and a weight average value of frictional ratio. The f/fo 
value was used as a parameter of the regularization 
function and also used to estimate the molecular mass 
from the c(M) plots [32,33]. The apparent sedimentation 
coefficients (s) were obtained from the maximum peak 
values of the c(S) curves. The standard sedimentation 
coefficients (s2o,w) at each protein concentration were 
estimated to avoid interferences caused by viscosity and 
density increment [26,31]. The Sednterp software http:// 
www.jphilo.mailway.com/download.htm was used to 
estimate important hydrodynamic parameters: (1) the 
partial specific volume (Vbar) for Sisl (0.7263 mL/g), 
Sisl_Ai24-i74 (0.7312 mL/g), Sisl_Ai2i-257 (0.7284 mL/g) 
and Ydjl_Aio6-255 (0.7331 mL/g) from their amino acid 
sequence; (2) the Sgph and D^ph for a globular protein of 
same molecular mass M and buffer viscosity (r| = 1.0605 
xlO'^ poise) and density (p = 1.01938 g/mL); and (3) to 
correct the apparent value of s to S2o,w The standard 
sedimentation coefficient extrapolated to 0 mg/mL (s^2o, 
^) was calculated by linear regression from values of S20, 
w versus the protein concentration. The molecular mass 
values were obtained as the ratio of the sedimentation 
to diffusion coefficient using the following equation. 



M = 



sRT 



D(l - VbarP) 



(4) 



Small-Angle X-ray scattering experiments 

SAXS experiments were performed at the D02A-SAXS2 
beamline of the Laboratorio Nacional de Luz Sincrotron 
(LNLS, Campinas-SP, Brazil). The X-ray scattering data 
were recorded using a two-dimensional position-sensi- 
tive MARCCD detector. The measurements were per- 
formed with a monochromatic X-ray beam (wavelength 
of A. = 1.488 A) and a sample-to-detector distance of 
1374.4 mm, corresponding to the scattering vector 
range of 0.01 < q < 0.25A'^, where q is the magnitude 
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of the q-vector defined by (29 is the scattering angle). 
The samples were placed in a 1-mm path length cell 
with mica windows [34]. The scattering patterns were 
recorded at two different sample concentrations for 
each sample: 7.1 and 3.1 mg/mL for Sisl, 6.8 and 4.2 
mg/mL for Sisl_Ai24-i74> 5.2 and 3.9 mg/mL for 
Sisl 

-^i2i-257> ^i^d 5.5 and 2.75 mg/mL for Ydjl_Aio6-255' 
All the samples were measured in buffer 25 mM Tris- 
HCl (pH 7.5) containing 500 mM NaCl. Three succes- 
sive frames of 300s each were recorded for each sample 
and two frames more for the buffer. The scattering 
curves were individually corrected for detector response 
and scaled by the incident beam intensity and the sam- 
ples absorption. The corrected buffer scattering curve 
was subtracted from the corresponding sample scatter- 
ing. The resulting curves were normalized by the respec- 
tive concentrations and carefully inspected to check for 
possible radiation-induced damage and concentration 
effects, but such effects were not observed. A 5.6 mg/ 
mL bovine serum albumin (BSA, 66 kDa) solution was 
used as a standard sample to determine the molecular 
mass of the proteins. The molecular mass was estimated 
by comparison of the extrapolated value of the intensity 
at the origin value, 1(0), of the samples scattering data 
with that from the reference solution of Bovine Serum 
Albumin (BSA) as described in Orthaber et al. [35] and 
Mylonas et al. [36]. 

SAXS data analysis 

Determination of the radius of gyration (Rg) was per- 
formed using the Guinier approximation: 

Iiq)=I{0)expi-q^Rl/3) (5) 

valid in the q-range where qRg<l [37-39]. The linear- 
ity of the scattering curves in the validity region con- 
firmed monodispersity of the samples and allowed 
further analysis. Moreover, Rg values were also evaluated 
from the pair distance distribution function p(r) calcu- 
lated from the scattering intensity data by means of the 
Indirect Fourier Transform package GNOM [40]. The p 
(r) function also provided the maximum dimension 
Dmax of the molecule, because p (r > D^ax) = 0 [38,39]. 
Both the Guinier approximation, and the calculated p(r) 
function provided values for the forward scattering 
intensity 1(0). These 1(0) values were used for the esti- 
mation of the molecular mass. The confirmation of the 
monodispersity and dimerization of the Sisl, Sisl_Ai24- 
174, Sisl_Ai2i-257> and Ydjl_Aio6-255 proteins was 
inferred from the molecular mass obtained for these 
proteins. The flexibility of the molecules was estimated 
from the scattered intensity analyzing the behavior of 
the Kratky curves (q^I(q) versus q) [38,39] and Porod 
plots (q^I(q) versus q^). These representations provided 



qualitative information about the degree of flexibility 
and compactness [38,39,41-43]. 

Ab initio modeling based on SAXS data 

Ab initio calculations based on SAXS data were per- 
formed to obtain low-resolution models for the confor- 
mation of the following proteins: Sisl, Sisl_Ai24-i74> 
Sisl_Ai2i-257> and Ydjl_Aio6-255' Two different ab initio 
approaches were applied using the dummy atoms (DA) 
and dummy residues (DR) modeling methods. Given the 
existence of a certain flexibility in these proteins those 
two types of calculations were necessary to identify the 
existence of common structural features between the 
different models. The dummy atoms modeling method 
provided a bead model whose calculated intensity fitted 
the experimental SAXS curve (see Figure SI in Addi- 
tional file 1). This DA approach was implemented using 
the program DAMMIN [44] in the q-range (q^ax <8/Rg) 
which ends up being slightly different for each protein. 

Since no unique solution can be obtained from this cal- 
culation, several independent calculations were per- 
formed. Thereupon, the models were pair wise compared 
and then averaged using programs of the DAMAVER 
suite [45] and SUPCOMB [46]. The latter program aligns 
two models represented by ensembles of points by mini- 
mizing a dissimilarity measure called Normalized Spatial 
Discrepancy (NSD). Generally, NSD values tend to zero 
for increasingly similar objects; when they significantly 
exceed 1, the objects systematically differ from one 
another [46]. Subsequently, new calculations were per- 
formed for each protein using the DR approach. The 
dummy residues modeling method provides further 
insights into the possible three dimensional conformation 
of the proteins and their deleted mutants in solution. 
This was implemented using the program GASBOR [47] 
using the full range of q values. Again, several indepen- 
dent calculations were performed and the NSD values 
were evaluated. However, an average of the DR represen- 
tations does not substantially improve the quality of the 
models due to the flexibility of the molecules. So, a quan- 
titative analysis of the NSD values obtained from the sev- 
eral models was also performed in order to obtain the 
most appropriate molecular conformation (i.e., the model 
having the lowest average NSD value), to describe the 
low resolution structure of the protein. 

SAXS-based Modeling of the multidomain arrangement 
for all proteins studied 

The topology of each protein and each deleted-domain 
mutant of Sisl was examined by applying a rigid body 
modeling method to the SAXS data. This approach 
employs a simulated annealing protocol to find the opti- 
mal positions and orientations of high-resolution struc- 
tures of the known regions of the protein. At the same 
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time, the conformation obtained for the unknown 
regions (flexible Unker attached to the appropriate resi- 
dues of the domains) was calculated finding the best fit 
to the experimental scattering data. The rigid body cal- 
culations were implemented by the program BUNCH 
[48] using the full q-range. In order to construct the 
model of the mutants of Sisl, we used the high resolu- 
tion structure of the Sisl C-terminal peptide-binding 
domain and the Sisl J-domain, both found in the Pro- 
tein Data Bank http://www.rcsb.org identified by the 
codes 1C3G and 2037 respectively. To compose the 
model for the Ydjl deleted domain mutant, we used the 
high resolution structure of the Ydjl dimerization 
domain (PDB code IXAO) and also the J-domain from 
the Sisl structure (PDB code 2037). Several indepen- 
dent calculations were performed for each protein. We 
applied a 2-point symmetry constraint to the model cal- 
culations, using the a priori knowledge that all of them 
were dimers in solution. The NSD values were evaluated 
in order to select the most typical model for each pro- 
tein. The fits for the models (Figure SI) and the corre- 
sponding chi values (Table SI) can be found in 
additional files 1 and 2 respectively. 

Additional material 



Additional file 1: Figure SI. Model fits for the four proteir^s studied in 
this work. 

Additional file 2: Table S2. Chi-values (root mean square of x^) 
obtained from the fit of the theoretical scattering calculated from the 
models of the experimental intensity curve. 
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